Picture for Qi Cao

Qi Cao

ATLAS: Agentic Test-time Learning-to-Allocate Scaling

Add code
Jun 01, 2026
Viaarxiv icon

Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense

Add code
May 29, 2026
Viaarxiv icon

AIBuildAI-2: A Knowledge-Enhanced Agent for Automatically Building AI Models

Add code
May 27, 2026
Viaarxiv icon

LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling

Add code
May 13, 2026
Viaarxiv icon

AIBuildAI: An AI Agent for Automatically Building AI Models

Add code
Apr 15, 2026
Viaarxiv icon

Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Add code
Mar 20, 2026
Viaarxiv icon

DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

Add code
Mar 17, 2026
Viaarxiv icon

FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

Add code
Jan 29, 2026
Viaarxiv icon